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General Achievement Trends — Arizona 

K- 12 enrollment — 1, 148,448 



The raw data used to develop these state profiles, including data for additional grade levels and years before 2002, can be found 
on the CEP Web site at www.cep-dc.org . Click on the link on the left for No Child Left Behind. In the Document Library, look for 
the most recent report on student achievement since 2002. Below the name of the report, click on the link for View State Profiles 
and Worksheets. Scroll down the page, and click on the Worksheet links for any state. 



Overall Achievement — Key Findings 

General results 

The tables in this profile present state test results in reading and math at three achievement levels (basic, proficient, and advanced) and at one 
grade each at the elementary, middle, and high school levels. These data are more complete than the percentage of students scoring proficient 
that is the main indicator used to determine adequate yearly progress under the No Child Left Behind Act. 

In general, Arizona students made gains at the basic and proficient achievement levels. Results were mixed at the advanced level. 

Specific results 

• Between 2005 and 2008, the percentage of students performing at the basic level and above in reading increased slightly at the 
elementary and high school levels and remained stable at the middle school level. In math, the percentage basic and above rose slightly 
at all three grade levels analyzed. 

• The percentage of students scoring at the proficient level and above in reading went up at a moderate-to-large rate at the elementary and 
middle school grades analyzed and increased slightly at the high school level. In math, the percentage proficient grew at a moderate-to- 
large rate at the elementary level, but remained relatively flat at the middle and high school levels. 

• The percentage of students reaching the advanced level in reading rose slightly at the middle school grade analyzed but dropped slightly 
at the elementary and high school levels. In math, Arizona made moderate-to-large gains in the percentage scoring advanced at the 
elementary level but saw a moderate-to-large decrease at the high school level. 
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Data Limitations 

2005 through 2008 
2005 through 2008 



Years of comparable percentage proficient data 
Years of data needed to compute effect sizes 



Test Characteristics 

The characteristics highlighted below are for the state reading and mathematics tests used for accountability under the No Child Left Behind Act 
(NCLB). 

Test(s) used for NCLB accountability Arizona's Instrument to Measure Standards Dual Purpose Assessment 

(AIMS DPA), grades 3-8 

Arizona’s Instrument to Measure Standards High School (AIMS HS) 
Arizona's Instrument to Measure Standards-Alternate (AIMS-A) 

Grades 3-8 and 10-12 (first administration of high school exam in 
grade 10, plus retake opportunities in grades 1 1 and 12) 

AZ uses four achievement levels: Falls Far Below the Standard, 
Approaches the Standard, Meets the Standard, and Exceeds the 
Standard. For our analyses we treated Approaches the Standard as 
Basic, Meets the Standard as Proficient, and Exceeds the Standard 
as Advanced. 

Yes 

2005 

Spring (fall window also available for AIMS HS) 

2005-06: Grades 4, 6, and 7 included in achievement profiles 
2005: Cut points reset 
2005: Change in test contractors 



Grades tested for NCLB accountability 
State labels for achievement levels 

High school NCLB test also used as an exit exam? 

First year test used 

Time of test administration 

Major changes in testing system (2002-present) 
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Overall Achievement — Percentages Proficient 

Figure AZ-1. Percentage of Students Scoring at the Proficient Levei and Above in Reading 
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Table AZ-1. Percentage of Students Scoring at the Proficient Level and Above in Reading 



Reporting Year Pre-NCLB Post-NCLB 

Grade Average Yearly Average Yearly 



Level 


1999 2000 


2001 


2002 2003 


2004 2005 


2006 


2007 


2008 


Percentage Point Gain 
1999-2002^ 


Percentage Point Gain 
2002-2008^ 


Grade 3 








65% 


67% 


69% 


69% 


NA 


1.3 


Grade 4 








64% 


65% 


65% 


70% 


NA 


2.0 


Grade 5 








67% 


67% 


70% 


68% 


NA 


0.3 


Grade 6 








65% 


65% 


67% 


68% 


NA 


1.0 


Grade 7 








66% 


65% 


67% 


70% 


NA 


1.3 


Grade 8 








64% 


63% 


63% 


67% 


NA 


1.0 


Grade 10 








73% 


72% 


72% 


74% 


NA 


0.3 



Table reads: The percentage of 3'^'^ graders who scored at the proficient level and above on the state reading test increased from 65% in 2005 to 69% in 2008. The 
average yearly gain in the percentage proficient in grade 3 reading was 1 .3 percentage points per year after NCLB was enacted. 

^Averages are subject to rounding error. 
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Figure AZ-2. Percentage of Students Scoring at the Proficient Levei and Above in Mathematics 
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Table AZ-2. Percentage of Students Scoring at the Proficient Level and Above in Mathematics 



Reporting Year Pre-NCLB Post-NCLB 

Grade Average Yearly Average Yearly 



Level 


1999 2000 


2001 


2002 2003 


2004 2005 


2006 


2007 


2008 


Percentage Point Gain 
1999-2002^ 


Percentage Point Gain 
2002-2008^ 


Grade 3 








72% 


72% 


72% 


71% 


NA 


-0.3 


Grade 4 








71% 


73% 


74% 


74% 


NA 


1.0 


Grade 5 








68% 


68% 


69% 


70% 


NA 


0.7 


Grade 6 








63% 


62% 


65% 


67% 


NA 


1.3 


Grade 7 








66% 


68% 


69% 


71% 


NA 


1.7 


Grade 8 








61% 


60% 


61% 


61% 


NA 


0.0 


Grade 10 








68% 


66% 


67% 


68% 


NA 


0.0 



Table reads: The percentage of 3'^'^ graders who scored at the proficient level and above on the state math test decreased from 72% 2005 to 71% in 2008. The 
average yearly decline in the percentage proficient in grade 3 math was 0.3 percentage points per year after NCLB was enacted. 

^Averages are subject to rounding error. 
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Overall Achievement — Percentages Advanced, Proficient, and Basic 

How to read figures 3 and 4 and tables 3 and 4 

The stacked bars in figures 3 and 4 show the percentages of students scoring at the basic, proficient, and advanced ievels on the state tests used 
for NCLB accountabiiity. The foiiowing information may be heipfui in interpreting the figures: 

• The percentage proficient and above — the benchmark used to determine adequate yearly progress under NCLB — is the sum of the middle 
and top segments of the bars (percentage proficient plus percentage advanced). 

• The percentage basic and above is the sum of all three segments of the bars (percentage basic plus percentage proficient plus 
percentage advanced). 

• The sums that result from adding the segments of the bars in these ways correspond with the percentages proficient and above, and basic 
and above, shown in tables 3 and 4. In a few instances, however, the sums in the figures may differ from those in the tables by a 
percentage point due to rounding. 

• The bars do not total 1 00% because students who score below the basic level are not displayed. 

• By looking at the percentages in each segment of the bars, one can see how achievement trends at the three levels interact. Ideally, one 
would want to see increases at all three levels, as more students move from below basic to basic achievement, from basic to proficient, 
and from proficient to advanced. But other scenarios may also be illuminating. For example, gains may occur in the percentage basic even 
if the percentage proficient and above has stayed the same, suggesting that progress has been made in moving students from the below 
basic to the basic level. Or, if the percentage proficient has grown while the percentages basic and advanced have shrunk, this suggests 
most of the academic attention was focused on moving “bubble kids” from the basic to proficient levels, with little or no attention to the 
highest- and lowest-performing students. 

• Some states use different labels for their achievement levels instead of basic, proficient, and advanced. The specific state labels are listed 
in the Test Characteristics section at the beginning of this profile. 
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Figure AZ-3. Percentages of Students Scoring at the Advanced, Proficient, and Basic Leveis in Reading 



Grade 4 



Grade 8 



100 

90 

80 

70 

60 

50 

40 

30 

20 

10 



10 



55 



24 



57 



24 



10 



55 



24 



□ Basic □ Protioient ■ Advanced 



60 



21 



2002 2003 2004 2005 2006 2007 2008 

School Year 



100 

90 

80 

70 

60 

50 

40 

30 

20 

10 



58 



25 



58 



26 



56 



26 



60 



22 



2002 2003 2004 2005 2006 2007 2008 

School Year 



□ Basic □ Proficient ■ Advanced 



100 

90 

80 

70 

60 

50 

40 

30 

20 

10 



Grade 10 



64 



20 



64 



22 



65 



20 



66 



21 



2002 2003 2004 2005 2006 

School Year 



2007 



2008 



□ Basic □ Proficient ■ Advanced 



2009 



GENERAL ACHIEVEMENT TRENDS — Arizona 



Table AZ-3. Percentages of Students Scoring at the Advanced, Proficient and Above, and Basic and Above Levels in Reading 









Reporting Year 








Average Yearly 
Percentage Point Gain^ 


Achievement Level 


2002 


2003 


2004 2005 


2006 


2007 


2008 


Grade 4 


Advanced 






10% 


8% 


10% 


9% 


-0.3 


Proficient and Above 






64% 


65% 


65% 


70% 


2.0 


Basic and Above 






89% 


89% 


89% 


90% 


0.3 


Grade 8 


Advanced 






6% 


5% 


7% 


7% 


0.3 


Proficient and Above 






64% 


63% 


63% 


67% 


1.0 


Basic and Above 






89% 


89% 


89% 


89% 


0.0 


Grade 10 


Advanced 






9% 


8% 


7% 


7% 


-0.7 


Proficient and Above 






73% 


72% 


72% 


74% 


0.3 


Basic and Above 






93% 


94% 


92% 


94% 


0.3 



Table reads: The percentage of 4*^ graders who scored at the advanced level on their state reading test decreased from 10% in 2005 to 9% in 2008. During this 
period, the average yearly loss in the percentage advanced was 0.3 percentage points per year in grade 4 reading. 

^Averages are subject to rounding error. 
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Figure AZ-4. Percentages of Students Scoring at the Advanced, Proficient, and Basic Leveis in Mathematics 
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Table AZ-4. Percentages of Students Scoring at the Advanced, Proficient and Above, and Basic and Above Levels in Mathematics 









Reporting Year 








Average Yearly 
Percentage Point Gain^ 


Achievement Level 


2002 


2003 


2004 2005 


2006 


2007 


2008 


Grade 4 


Advanced 






21% 


24% 


27% 


25% 


1.3 


Proficient and Above 






71% 


73% 


74% 


74% 


1.0 


Basic and Above 






88% 


90% 


89% 


89% 


0.3 


Grade 8 


Advanced 






13% 


13% 


14% 


13% 


0.0 


Proficient and Above 






61% 


60% 


61% 


61% 


0.0 


Basic and Above 






79% 


80% 


79% 


80% 


0.3 


Grade 10 


Advanced 






18% 


14% 


13% 


14% 


-1.3 


Proficient and Above 






68% 


66% 


67% 


68% 


0.0 


Basic and Above 






79% 


78% 


79% 


80% 


0.3 



Table reads: The percentage of 4*^ graders who scored at the advanced level on their state math test increased from 21% in 2005 to 25% in 2008. During this 
period, the average yearly gain in the percentage advanced was 1.3 percentage points per year in grade 4 math. 

^Averages are subject to rounding error. 
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Overall Achievement — Effect Sizes 

How to read figures 5 and 6 and tables 5 and 6 

An effect size is a statisticai tool that conveys the amount of difference between test results using a common unit of measurement which does not 
depend on the scoring scale for a particular test. An effect size is computed by subtracting the mean scale score (the average score) on a test for 
one year, such as 2006, from the mean scale score for another year, such as 2007, then dividing the result by the average standard deviation. 
(The standard deviation is a measure of how much test scores tend to deviate from the mean — in other words, how spread out or bunched 
together scores are.) If the mean score has not changed, then the effect size is 0. An effect size of +1 indicates an increase of 1 standard 
deviation from the previous year’s mean score. Effect sizes can also be used to calculate differences in scores between two subgroups of 
students. 

Tables 5 and 6 show mean scale scores, standard deviations, and the accumulated annual effect size (AAES), which is the cumulative gain in 
effect size over a range of years. For example, to determine the accumulated annual effect size between 2006 and 2008, one would calculate the 
change in effect size from 2006 to 2007, and from 2007 to 2008, then add the results together. In figures and tables 5 and 6, 2002 (or the closest 
year with comparable data) was used as a starting point (0.00) to calculate accumulated annual effect sizes after NCLB was enacted (and before, 
if available). Steady gains in AAES are represented by negative numbers before 2002 rising to positive numbers after 2002, so that pre- and post- 
NCLB trends can be shown on the same trend line. A positive AAES before 2002 or a negative AAES after 2002 indicates a decline in 
performance over time. 




Accumulated Annual Effect Sizes 
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Figure AZ-5. Reading Achievement Trends in Terms of Effect Sizes 
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Table AZ-5. Reading Achievement Trends in Terms of Effect Sizes 













Reporting Year 










Pre-NCLB 

Average 


Post-NCLB 

Average 


Level 




1999 


2000 


2001 


2002 


2003 


2004 


2005 


2006 


2007 


2008 


Yearly Effect 
Size Gain 
1999-2002^ 


Yearly Effect 
Size Gain 
2002-2008^ 


Grade 4 


MSS 

(SD) 














469 

(53) 


466 

(63) 


467 

(66) 


470 

(67) 








AAES 












0.00 


-0.05 


-0.04 


0.01 


NA 


0.00 


Grade 8 


MSS 

(SD) 














517 

(52) 


514 

(71) 


516 

(75) 


518 

(76) 








AAES 












0.00 


-0.05 


-0.02 


0.01 


NA 


0.00 


Grade 1 0 


MSS 

(SD) 














699 

(55) 


697 

(84) 


697 

(83) 


697 

(81) 








AAES 












0.00 


-0.03 


-0.03 


-0.03 


NA 


-0.01 



Table reads: The mean scale score (MSS) of 4*^ graders on the state reading test increased from 469 in 2005 to 470 in 2008. The standard deviation (SD) for the 
mean scale score in 2005 was 53. Using 2005 as a starting point (0.00), the accumulated annual effect size (AAES) for grade 4 reading totaled 0.01 by 2008. For 
the post-NCLB period, the average yearly gain in effect size at grade 4 was 0.00. 

Note: Arizona's Instrument to Measure Standards Dual Purpose Assessment for grades 3-8 is scored on a scale of 200-800. The AIMS High School assessment is 
scored on a scale of 500-900. 

^Averages are subject to rounding error. 






Accumulated Annual Effect Sizes 
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Figure AZ-6. Mathematics Achievement Trends in Terms of Effect Sizes 
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Table AZ-6. Mathematics Achievement Trends in Terms of Effect Sizes 



Grade 












Reporting Year 










Pre-NCLB 

Average 


Post-NCLB 

Average 


Level 




1999 


2000 


2001 


2002 


2003 


2004 


2005 


2006 


2007 


2008 


Yearly Effect 
Size Gain 
1999-2002^ 


Yearly Effect 
Size Gain 
2002-2008^ 


Grade 4 


MSS 

(SD) 














477 

(53) 


479 

(67) 


481 

(70) 


480 

(70) 








AAES 














0.00 


0.03 


0.06 


0.05 


NA 


0.02 


Grade 8 


MSS 

(SD) 














553 

(58) 


550 

(78) 


552 

(80) 


552 

(80) 








AAES 














0.00 


-0.04 


-0.02 


-0.02 


NA 


-0.01 


Grade 1 0 


MSS 

(SD) 














704 

(52) 


695 

(81) 


697 

(78) 


698 

(78) 








AAES 














0.00 


-0.13 


-0.11 


-0.09 


NA 


-0.03 



Table reads: The mean scale score (MSS) of 4*^ graders on the state math test increased from 477 in 2005 to 480 in 2008. The standard deviation (SD) for the 
mean scale score in 2005 was 53. Using 2005 as a starting point (0.00), the accumulated annual effect size (AAES) for grade 4 math totaled 0.05 by 2008. For the 
post-NCLB period, the average yearly gain in effect size at grade 4 was 0.02. 

Note: Arizona's Instrument to Measure Standards Dual Purpose Assessment for grades 3-8 is scored on a scale of 200-800. The AIMS High School assessment is 
scored on a scale of 500-900. 

^Averages are subject to rounding error. 
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Key Terms 

Percentage proficient (and above) — The percentage of students in a group who score at and above the cut score for “proficient” performance on 
the state test used to determine progress under NCLB. The Act requires states to report student test performance in terms of at least three 
achievement levels: basic, proficient, and advanced. Adequate yearly progress determinations are based on the percentage of students scoring at 
the proficient level and above. 

Percentage basic (and above) — The percentage of students in a group who score at and above the cut score for “basic” performance on the 
state test used to determine progress under NCLB. 

Percentage advanced — The percentage of students in a group who reach or exceed the cut score for “advanced” performance on the state test 
used to determine progress under NCLB. 

Moderate-to-iarge gain — For the percentage basic, proficient, or advanced, an average gain of 1 or more percentage points per year. For effect 
size, an average gain of 0.02 or greater per year. 

Siightgain — For the percentage basic, proficient, or advanced, an average gain of less than 1 percentage point per year. For effect size, an 
average gain of less than 0.02 per year. 

Moderate-to-iarge deciine — For the percentage basic, proficient, or advanced, an average decline of 1 or more percentage points per year. For 
effect size, an average decline of 0.02 or greater per year. 

Siight deciine — For the percentage basic, proficient, or advanced, an average decline of less than 1 percentage points per year. For effect size, 
an average decline of less than 0.02 per year. 

Effect size — A statistical tool that conveys the amount of difference between test results using a common unit of measurement which does not 
depend on the scoring scale for a particular test. 

Accumuiated annuai effect size — The cumulative gain in effect size over a range of years. 

Mean scaie score — The arithmetical average of a group of test scores, expressed on a common scale for a particular state’s test. The mean is 
calculated by adding the scores and dividing the sum by the number of scores. 

Standard deviation — A measure of how much test scores tend to deviate from the mean — in other words, how spread out or bunched together 
test scores are. If students’ scores are bunched together, with many scores close to the mean, then the standard deviation will be small. If scores 
are spread out, with many students scoring at the high or low ends of the scale, then the standard deviation will be large. 
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Cautions and Explanations 

Different labels for achievement levels — For consistency, aii of the state profiies deveioped for this report use a common set of iabeis (basic, 
proficient, and advanced) for the main achievement ieveis required by NCLB. In practice, however, some states may use different iabeis, such as 
“meets standard” instead of proficient, and some states have estabiished additionai achievement ieveis beyond those required by NCLB. 

Different names for subgroups — For the sake of consistency and ease of data tabulation, all of the state profiles developed for this report use a 
common set of names for the major student subgroups. In practice, however, states use various names for subgroups that may differ from those 
used here (such as using “Hispanic” instead of “Latino,” or “special education students” instead of “students with disabilities”). Moreover, a few 
states separately track the performance of subgroups not included in the analyses for this report. 

Special caution for students with disabilities and English language learners — T rends for students with disabilities and English language learners 
should be interpreted with caution because changes in federal guidance and state accountability plans may have altered which students in these 
subgroups are tested for accountability purposes, how they are tested, and when their test scores are counted as proficient under NCLB. These 
factors could affect the year-to-year comparability of test results. 

Inclusion of former English language learners — In many states, the subgroup of English language learners (also known as limited English 
proficient students) includes students who were formerly English language learners but who have achieved English language proficiency or 
fluency in the last two years. Federal NCLB regulations permit states to include these formerly ELL students (sometimes referred to as 
“redesignated fluent English proficient” students) in the ELL subgroup for up to two years for purposes of NCLB accountability. 

Limitations of percentage proficient measure — The percentage proficient, the main gauge of student performance under NCLB, can be easily 
understood and gives a snapshot of how many students have met their state’s performance expectations. But it also has several limitations as a 
measure of student achievement. Users of percentage proficient data should keep in mind these limitations, particularly the following: 

* “Proficient” means different things across different states. States vary widely in curriculum, learning expectations, and tests, and state tests differ 
considerably in their difficulty and cut scores for proficient performance. 

* Although this study has taken steps to avoid comparing test data where there have been “breaks” in comparability resulting from new tests, 
changes in content standards, revised cut scores, or other major changes in testing programs, the year-to-year comparability of test results in 
the same state may still be affected by less obvious policy and demographic changes. 

* Changes in student performance may occur that are not reflected in percentage proficient data, such as an increase in the number of students 
reaching performance levels below and above proficient (such as the basic or advanced levels). 

* The size of the achievement gaps between various subgroups depends in part on where a state sets its cut score for proficiency. For example, if 
a proficiency cut score is set so high that almost nobody reaches it or so low that almost everyone reaches it, there will be little apparent 
achievement gap. By contrast, if the cut score is closer to the mean test score, the gaps between subgroups will be more apparent. 

Difficulty of attributing causes — Although the tables above show trends in test scores since the enactment of NCLB, one cannot assume that 
these trends have occurred because of NCLB. It is always difficult to determine a cause-and-effect relationship between test score trends and any 
specific education policy or program due to the many federal, state, and local reforms undertaken in recent years and due to the lack of an 
appropriate “control” group of students not affected by NCLB. 





